'data.frame': 15190 obs. of 37 variables:
$ imdbRating : chr "8.4" "8.3" "8.4" "8.3" ...
$ ratingCount : chr "40550" "45319" "81007" "37521" ...
$ duration : chr "3240" "5700" "9180" "6420" ...
$ nrOfWins : chr "1" "2" "3" "1" ...
$ nrOfNominations : chr "0" "1" "4" "1" ...
$ nrOfPhotos : chr "19" "35" "67" "53" ...
$ nrOfNewsArticles: chr "96" "110" "428" "123" ...
$ nrOfUserReviews : int 85 122 376 219 186 254 211 180 653 226 ...
$ nrOfGenre : int 3 3 2 3 3 3 2 2 3 1 ...
$ Action : int 0 0 0 1 0 0 0 0 0 0 ...
$ Adult : int 0 0 0 0 0 0 0 0 0 0 ...
$ Adventure : int 0 1 0 1 0 0 0 0 0 0 ...
$ Animation : int 0 0 0 0 0 0 0 0 0 0 ...
$ Biography : int 0 0 0 0 0 0 0 0 0 0 ...
$ Comedy : int 1 1 0 1 1 0 1 1 0 0 ...
$ Crime : int 0 0 0 0 0 1 0 0 0 0 ...
$ Documentary : int 0 0 0 0 0 0 0 0 0 0 ...
$ Drama : int 1 0 1 0 1 1 0 1 1 1 ...
$ Family : int 1 1 0 0 0 0 0 0 0 0 ...
$ Fantasy : int 0 0 0 0 0 0 0 0 0 0 ...
$ FilmNoir : int 0 0 0 0 0 0 0 0 0 0 ...
$ GameShow : int 0 0 0 0 0 0 0 0 0 0 ...
$ History : int 0 0 0 0 0 0 0 0 0 0 ...
$ Horror : int 0 0 0 0 0 0 0 0 0 0 ...
$ Music : int 0 0 0 0 0 0 0 0 0 0 ...
$ Musical : int 0 0 0 0 0 0 0 0 0 0 ...
$ Mystery : int 0 0 0 0 0 0 0 0 0 0 ...
$ News : int 0 0 0 0 0 0 0 0 0 0 ...
$ RealityTV : int 0 0 0 0 0 0 0 0 0 0 ...
$ Romance : int 0 0 0 0 1 0 1 0 1 0 ...
$ SciFi : int 0 0 1 0 0 0 0 0 0 0 ...
$ Short : int 0 0 0 0 0 0 0 0 0 0 ...
$ Sport : int 0 0 0 0 0 0 0 0 0 0 ...
$ TalkShow : int 0 0 0 0 0 0 0 0 0 0 ...
$ Thriller : int 0 0 0 0 0 1 0 0 0 0 ...
$ War : int 0 0 0 0 0 0 0 0 1 0 ...
$ Western : int 0 0 0 0 0 0 0 0 0 0 ...
From first, I have taken this data set from kaggle, as poorly formatted data set. Fortunately most of the data set were well sorted according to table. But still there were observations which were not rightly formatted, so we had to lose all of them.
source: kaggle website for dataset.
imdbRating ratingCount duration nrOfWins
Min. :1.000 Min. : 2.4 Min. : 2 Min. : 0.000
1st Qu.:6.300 1st Qu.: 492.0 1st Qu.: 3600 1st Qu.: 0.000
Median :7.000 Median : 3642.0 Median : 5700 Median : 0.000
Mean :6.864 Mean : 25890.9 Mean : 5799 Mean : 9.918
3rd Qu.:7.600 3rd Qu.: 19885.0 3rd Qu.: 6660 3rd Qu.: 2.000
Max. :9.900 Max. :1183395.0 Max. :379114 Max. :7620.000
NA's :1608 NA's :1248 NA's :1039 NA's :388
nrOfNominations nrOfPhotos nrOfNewsArticles nrOfUserReviews
Min. : 0.000 Min. : 0.0 Min. : 0.0 Min. : 0.0
1st Qu.: 0.000 1st Qu.: 0.0 1st Qu.: 0.0 1st Qu.: 3.0
Median : 0.000 Median : 6.0 Median : 8.0 Median : 29.0
Mean : 5.489 Mean : 23.3 Mean : 245.7 Mean : 103.6
3rd Qu.: 3.000 3rd Qu.: 25.0 3rd Qu.: 96.0 3rd Qu.: 102.0
Max. :2011.000 Max. :2810.0 Max. :32345.0 Max. :4928.0
NA's :34 NA's :7 NA's :1
I will present graphs for all the variables individually in order to understand about it’s behavior.
Note:
dependent variable is asymmetric, rightly skewed.
Note:
scaling some of the independent variable
would be good idea to implement in order
to get control on sensitivity or have clear
picture about the happening.\(Correlation-Matrix\)
As described in our
correlation matrix darker
color shows the correlation
among explanatory variables.
For instance variable
ratingCount and
nrOfUserReviews highly cor
related with each other.
Hence one can be removed
from modeling.
---
title: "IMDB Prediction"
output:
flexdashboard::flex_dashboard:
orientation: column
social: ["facebook","twitter"]
theme: yeti
source_code: embed
---
```{r setup, include=FALSE,cache=T}
library(flexdashboard)
suppressPackageStartupMessages( library(dplyr))
suppressPackageStartupMessages(library(ggplot2))
suppressPackageStartupMessages(library(plotly))
library(caret)
```
Data Introductory
====================================
Introduction with data {data-width=400}
----------------------------------
### Importing data set
```{r import part}
imdb <- read.csv("imdb.csv") #import of the data file in R.
rating_pred <- imdb[,-c(1:5,9,10)] #removing redundant
str(rating_pred) #checking structure, since our data is a from kaggle. so this is highly #unformulated
rating_pred <- na.omit(rating_pred) #removing NAs
## Recoding rating_pred$imdbRating into rating_pred$imdbRating
rating_pred$imdbRating <- as.character(rating_pred$imdbRating)
rating_pred$imdbRating[rating_pred$imdbRating == ""] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " and Gays (TV Episode 2004)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " grAs - Die Serie (TV Series 2000â\200 )"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " hat die Wahl (2000)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Mary (TV Episode 1998)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Paranormal Activity\\"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Sons (TV Episode 2006)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Spion (2011)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == " Winter... und Frühling (2003)"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "achtung fertig charlie"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "alamo der traum das schicksal die legende"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "bergman och filmen bergman och teatern bergman och f r tv movie"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "brows held high shakespeare film and kenneth branagh a retrospective tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "celebrity sch n reich ber hmt"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "cin mas d horreur apocalypse virus zombies"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "coupling wer mit wem perhaps perhaps perhaps tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "crazy stupid love"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "deine meine unsere"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "die mommie die"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "dr seltsam oder wie ich lernte die bombe zu lieben"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "easy riders raging bulls how the sex drugs and rock n roll generation saved hollywood"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "ed edd n eddy tv series"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "eins zwei drei"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "einsam zweisam dreisam"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "estrenos cr ticos mientras duermes contagio sin salida tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "genre mill memories wandering butterflies turkish cats karate warriors and me video"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0009932/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0011237/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0012522/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0013427/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0013442/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0014636/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0015074/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0018051/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0023551/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0024601/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0025452/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0027532/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0029335/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0031385/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0033408/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0033477/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0033563/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0034299/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0034862/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0035795/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0036008/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0036172/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0037101/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0037824/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0037931/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0038059/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0038182/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0038890/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0040580/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0040928/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0041085/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0041113/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0043456/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0043461/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0044084/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0044509/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0044916/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0045125/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0045554/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0045963/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0047562/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0047977/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0048183/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0048545/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0049470/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0049471/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0049762/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0050095/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0052646/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0052893/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0053084/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0053363/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0054390/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0054518/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0054528/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0054642/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0055093/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0055233/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0056173/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0057171/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0057191/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0057193/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0057547/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0058265/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0058437/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0058536/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0059742/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0059749/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0059903/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060009/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060143/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060304/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060550/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060556/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0060955/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061170/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061176/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061610/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061735/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0061791/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0062082/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0062467/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0063152/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0063661/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0063925/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0063950/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0064418/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0065610/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0065797/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066049/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066108/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066364/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066612/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0066730/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0067402/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0068182/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0068240/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0068286/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0068555/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0069097/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0069495/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0069547/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0069824/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0071411/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0071555/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0072052/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0072973/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0073341/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0074006/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0074851/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0075007/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0075132/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0075323/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076070/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076155/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076451/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076538/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0076752/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0077975/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0078841/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0080031/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0080129/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0080388/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0080761/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0081353/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0082198/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0082418/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0083715/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0083745/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0084938/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0085933/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0086250/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0086837/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0087365/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0087884/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0089670/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0089907/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0090633/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0090685/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0090852/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0091080/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0091149/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0091209/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0091214/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0092593/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0093105/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0093164/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0093543/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0094641/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0094988/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0094991/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0095595/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0096061/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0097289/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0097523/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0097757/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0097958/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0098360/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0098724/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0098749/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0099850/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0100050/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0100928/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0101120/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0101329/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0101393/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0102027/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0103927/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0104437/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0104974/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0106168/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0106332/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0107209/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0108174/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0108828/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0108927/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0109771/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0110478/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0111579/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0111942/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0112346/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0114108/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0116059/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0116493/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0117284/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0117786/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0117958/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118004/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118655/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118829/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118843/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0118925/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119207/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119229/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119345/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119432/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119465/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0119484/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0120789/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0120834/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0120868/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0125061/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0129884/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0130671/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0139735/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0141163/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0147800/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0162360/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0163187/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0164877/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0168590/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0169858/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0179473/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0183505/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0190590/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0192335/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0197521/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0205461/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0212338/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0234853/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0240684/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0242423/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0247144/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0263975/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0264235/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0265713/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0273855/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0276345/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0290002/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0294097/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0306274/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0317836/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0319061/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0323587/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0337921/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0338261/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0343818/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0345561/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0360201/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0362359/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0364517/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0365285/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0365376/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0367110/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0367478/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0373760/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0378072/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0378284/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0381348/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0383028/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0393735/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0394587/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0401711/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0416046/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0418769/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0421054/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0429589/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0432637/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0433028/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0433383/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0433416/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0433771/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0442268/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0443295/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0443649/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0449514/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0459293/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0461412/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0462501/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0464913/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0466399/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0466642/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0483703/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0485513/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0494716/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0499516/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0504240/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0520589/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0539476/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0550527/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0566900/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0588221/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0595951/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0609115/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0684837/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0699676/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0708502/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0745906/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0748792/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0758774/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0760329/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0770772/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0775349/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0784159/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0800950/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0808399/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0808506/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0815236/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0815353/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0818276/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0819379/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0820911/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0832266/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0836148/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0862856/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0864311/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0874827/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0879221/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0968294/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0969647/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0970416/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0970866/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt0996966/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1017771/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1032815/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1032846/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1067733/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1092633/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1134859/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1137936/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1156312/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1185371/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1189073/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1200078/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1234548/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1237375/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1286537/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1294574/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1303900/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1329457/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1334260/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1336006/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1337117/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1338542/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1353866/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1381010/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1414501/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1421046/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1430966/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1456941/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1488565/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1496905/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1549449/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1586265/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1588334/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1590024/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1601895/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1608777/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1635614/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1649419/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1679204/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1718837/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1740712/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1741225/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1742336/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1814187/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1816777/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1833919/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1860353/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1894193/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1922777/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1923214/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1926567/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1930748/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1934915/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1936736/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1965492/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt1981825/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2013841/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2040639/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2059151/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2086872/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2161445/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2175739/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2180851/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2203975/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2205697/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2313197/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2385639/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2396767/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2655706/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2669622/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2739384/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt2761156/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt3198848/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt3359268/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "http://www.imdb.com/title/tt3567828/"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "jeanne dielman quai du commerce bruxelles"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "johnny guitar gejagt geha t gef rchtet"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "kill daddy kill"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "liebe l ge leidenschaft tv series"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "love wedding marriage ein plan zum verlieben"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "man taraneh panzdah sal daram"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "mary max oder schrumpfen schafe wenn es regnet"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "micky donald goofy die drei musketiere video"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "o vertrauen verf hrung verrat"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "pepi luci bom und der rest der bande"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "phineas und ferb run candace run last train to bustville tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "psych lock stock some smoking barrels and burton guster s goblet of fire tv episode"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "ready steady cook tv series"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "sprung auf marsch marsch"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "to wong foo thanks for everything julie newmar"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "tomorrow yesterday and today video"] <- NA
rating_pred$imdbRating[rating_pred$imdbRating == "weiblich ledig jung sucht"] <- NA
```
***
From first, I have taken this data set from kaggle, as poorly formatted data set. Fortunately most of the data set were well sorted according to table. But still there were observations which were not rightly formatted, so we had to lose all of them.
> source: kaggle website for dataset.
Summary
----------------------------------
### Summary View (Numeric)
```{r summary part}
numeric <- as.data.frame(apply(rating_pred[,c(1:8)], 2, as.numeric))
#converting variables into numeric and excluding NAs
summary(numeric)
cate <- as.data.frame(apply(rating_pred[,c(9:37)], 2,as.factor))## categorical variables in ### data set
imdb_rating <- as.data.frame(cbind(numeric,cate))%>%
na.omit() #combinning both numeric and categorical variables
```
Dependent Exploration {data-navmenu="Graph"}
===========================================
***
I will present graphs for all the variables individually in order to understand about it's behavior.
`Note:`
dependent variable is asymmetric, rightly skewed.
```{r,cache=TRUE}
p <- imdb_rating %>% na.omit() %>%
ggplot(aes(imdbRating))+
geom_bar(aes(col="red") )+ylab("Count")+xlab("IMDB-Rating")+
ggthemes::theme_tufte()+
geom_vline(xintercept = c(4.5,9))
ggplotly(p)
```
Independant Numeric {data-navmenu="Graph"}
=========================================
***
`Note:`
scaling some of the independent variable
would be good idea to implement in order
to get control on sensitivity or have clear
picture about the happening.
```{r,cache=TRUE}
p1 <- ggplot(imdb_rating, aes(nrOfNominations,imdbRating))+
geom_point()+
geom_jitter(aes(.7))+
ylab("")
# ggplotly(p1)
p2 <- imdb_rating%>%
ggplot(aes(duration,imdbRating))+
geom_point()+
geom_jitter(aes(.7))+
ylab("")
#ggplotly(p2)
p3 <- imdb_rating%>%
ggplot(aes(nrOfWins,imdbRating))+
geom_point()+
geom_jitter(aes(.7))+
ylab("")
#ggplotly(p3)
p4 <- imdb_rating%>%
ggplot(aes(nrOfNewsArticles,imdbRating))+
geom_point()+
geom_jitter(aes(.9)) +
ylab("")
#ggplotly(p4)
p5 <- imdb_rating %>%
ggplot(aes(ratingCount,imdbRating))+
geom_point()+
geom_jitter(aes(.7))+ylab("")
p6 <- imdb_rating %>%
ggplot(aes(nrOfUserReviews,imdbRating))+
geom_point()+
geom_jitter(aes(.7))+
ylab("")
p7 <- imdb_rating %>%
ggplot(aes(nrOfPhotos,imdbRating))+
geom_point()+
geom_jitter(aes(.7))+
ylab("")
gridExtra::grid.arrange(p1,p2,p3,p4,p5,p6,p7,nrow=4)
```
Categorical Graphs {data-navmenu="Graph"}
==================================================
```{r,cache=TRUE}
attach(imdb_rating)
bplot <- function(feature,inde){
p <- ggplot(imdb_rating,aes(feature,imdbRating))+
geom_boxplot(outlier.colour = "red",outlier.shape = 8)+
ggthemes::theme_base()+
theme(
axis.line.y = element_blank(),
#axis.text.y = element_blank(),
axis.title.y = element_blank(),
axis.ticks = element_blank()
)+
#ggtitle("Box-Plot(IMDB~Feature)")+
#ylab("IMDB")+
xlab(inde)
return(p)
}
par(mfrow=c(6,7))
bplot(Thriller,"Thriller")
bplot(nrOfGenre,"No of Genre")
bplot(Action,"Action")
bplot(Adult,"Adult")
bplot(Adventure,"Adventure")
bplot(Animation,"Animation")
bplot(Biography,"Biography")
bplot(Comedy,"Comedy")
bplot(Crime,"Crime")
bplot(Documentary,"Documentary")
bplot(Drama,"Drama")
bplot(Family,"Family")
bplot(Fantasy,"Fantasy")
bplot(FilmNoir,"Filmnoir")
bplot(GameShow,"Gameshow")
bplot(History,"History")
bplot(Horror,"Horror")
bplot(Music,"Music")
bplot(Musical,"Musical")
bplot(Mystery,"Mystery")
bplot(News,"News")
bplot(RealityTV,"RealityTV")
bplot(Romance,"Romance")
bplot(SciFi,"Sci-fi")
bplot(Short,"Short")
bplot(Sport,"Sport")
bplot(TalkShow,"Talkshow")
bplot(War,"War")
bplot(Western,"Western")
```
Correlation {data-icon="fa-pencil"}
===================================================
$Correlation-Matrix$
***
As described in our
correlation matrix darker
color shows the correlation
among explanatory variables.
For instance variable
`ratingCount` and
`nrOfUserReviews` highly cor
related with each other.
Hence one can be removed
from modeling.
```{r}
numeric %>%
na.omit() %>%
cor() %>%
corrplot::corrplot(type = "upper")
```